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(54) Title: FIELD PROGRAMMABLE PROCESSOR ARRAYS 
(57) Abstract 

An integrated circuit has a field programmable 
circuit region arranged as a generally rectangular 
array of rows and columns of circuit areas. Some 
of the circuit areas (12) each provide a respective 
processing unit for performing operations on data on 
at least one respective input signal path (an, aw, 
be, bs, hci, vci) to provide data on at least one 
respective output signal path (fn, fe, fs, fw, vco, hco). 
Others of the circuit areas each provide a respective 
switching section (14); and the processing units and 
the switching sections are arranged alternately in 
each row and in each column. Each of a substantial 
proportion of the switching sections provides a 
programmable connection (16, 18, 20) between at 
least some of the signal paths of those of the 
processing units adjacent that switching section in the 
same column and in the same row. A dense layout 
can be obtained with efficient local interconnections, 
especially in the case where one or more of the 
processing units has a plural-bit input and/or a 
plural-bit output, and at least some of the signal 
paths are provided by respective plural-bit busses. 
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TIT LE 

HELD PROGRAMMABLE PROCESSOR ARRAYS 

5 

DESCRIPTION 

This invention relates to field programmable processor arrays, that is, arrays of 
processing units which are selectively connectable by signal paths one to another, and 
10 in particular to such arrays which are provided as integrated circuits. 

The problems with which the present invention (or at least preferred embodiments of it) 
is concerned are to provide a dense circuit layout, efficient interconnections between the 
processing units and flexibility in the manner in which the processing units may be 
IS interconnected. 

In accordance with the present invention, there is provided an integrated circuit having 
a field programmable circuit region arranged as a generally rectangular array of rows 
and columns of circuit areas, wherein: some of the circuit areas each provide a 

20 respective processing unit (such as an arithmetic logic unit) for performing operations 
on data on at least one respective input signal path to provide data on at least one 
respective output signal path; others of the circuit areas each provide a respective 
switching section; the processing units and the switching sections are arranged 
alternately in each row and in each column; and each of a substantial proportion (and 

25 preferably all) of the switching sections provides a programmable connection between 
at least some of the signal paths of those of the processing units adjacent that switching 
section in the same column and in the same row. By arranging the circuit in this way, 
a dense layout can be obtained with efficient local interconnections, especially in the case 
where one or more of the processing units has a plural-bit input and/or a plural-bit 

30 output, and at least some of the signal paths are provided by respective plural-bit busses . 
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A first type of the connections provided by the switching sections are between such 
signal paths which may be generally collinear with or parallel to each other, allowing 
the paths to be selectably concatenated to produce longer connections. In this case and 
in the case where at least one of the processing units and the processing units adjacent 
5 thereto each have a first input, a second input and an output; the output of said one 
processing unit is preferably connectable: by such a first type of connection to the first 
input of the next processing unit in one direction in the same row; by such a first type 
of connection to the first input of the next processing unit in the one direction in the 
same column; by such a first type of connection to the second input of the next 

10 processing unit in the opposite direction in the same row; and by such a first type of 
connection to the second input of the next processing unit in the opposite direction in the 
same column. In this way, the output of a first processing unit can be selectably passed 
to any of the four adjacent processing units in the same row or column ("a second 
processing unit"), where it can be processed and then passed back to the first processing 

15 unit or passed on to any of the other three adjacent processing units adjacent-the second 
processing unit. 



A second type of the connections provided by the switching sections are between such 
signal paths which may be generally orthogonal to each other. Thus the signal paths can 
20 change direction to enable flexible routing. In this case and in the case where at least one 
of the processing units and the processing units adjacent thereto each have a first input, 
a second input and an output, the output of said one processing unit is preferably 
connectable: by such a second type of connection in the same column to the first input 
of the diagonally adjacent processing unit in said one row direction and said one column 
25 direction; by such a second type of connection in the same row to the first input of the 
diagonally adjacent processing unit in said opposite row direction and said one column 
direction; by such a second type of connection in the same column to the second input 
of the diagonally adjacent processing unit in said opposite row direction and said 
opposite column direction; and by such a second type of connection in the same row to 
the second input of the diagonally adjacent processing unit in said one row direction and 
said opposite column direction. Thus, the output from a first processing unit can be 



30 
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selectably routed to the first inputs of two of the diagonally adjacent processing units, 
and to the second inputs of the other two diagonally adjacent processing units. 

Preferably substantially all of the input and output signal paths are oriented in directions 
5 substantially parallel to the rows or the columns, thus enabling a dense layout to be 
achieved. 

The integrated circuit preferably further comprises a plurality of inter switching section 
signal paths, each of which extends from a respective first one of the switching sections 

10 to a respective second one of the switching sections in the same row in a direction 
primarily generally parallel to that row, or in the same column in a direction primarily 
generally parallel to that column, each of the inter switching section signal paths being 
programmably connectable by the respective first switching section to others of &e 
signal paths at that first switching section, and being programmably connectable by ttoe 

15 respective second switching section to others of the signal paths at that second switching 
section. Accordingly, medium and long range connections can be selectably provided. 

For one type of the inter switching section signal paths, there may be no such switching 
sections in the respective row or column between the respective First and second 
20 switching sections. 

For another type of the inter switching section signal paths, the respective first and 
second switching sections may have a number (preferably one less than a powe r of two) 
of other such switching sections therebetween in the respective row or column. Thus, 
25 long range connections are provided reducing the number of intermediate switches which 
may be required, and accordingly reducing the propagation delay which would be caused 
thereby. 

For a further type of the inter switching section signal paths, each signal path may have 
30 a spine portion extending in a direction generally parallel to the respective row or 
column and first and second end portions each extending in a direction generally 
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orthogonal to the respective row or column and interconnecting the spine portion and the 
respective first and second switching sections, respectively. Since the spine portions do 
not connect to the switches directly, the choice of the physical position of the conductors 
in the spine portion is flexible, and this flexibility enables denser layouts to be achieved. 
5 For at least some of the inter switching section signal paths of said further type, the 
respective first and second switching sections may have a number (preferably one less 
than a power of two) of other such switching sections therebetween in the respective row 
or column. Thus, long range connections are provided reducing the number of 
intermediate switches which may be required, and accordingly reducing the propagation 
10 delay which would be caused thereby. At least some of the inter switching section signal 
paths of . said further type preferably each have at least one tap portion extending in a 
direction generally orthogonal to the respective row or column and interconnecting the 
spine portion and a respective such other switching section. Accordingly, even greater 
flexibility is provided. 

15 

At least some of the switching sections preferably each include a respective register 
and/or buffer having an input and an output each switchably connectable to at least some 
of the signal paths at that switching section. This allows signals to be retimed or buffered 
without using one of the processing units for that purpose. Retiming and buffering of 
20 time-critical signals allows configurations to run at higher clock speeds, increasing the 
rate of operation. 

A specific embodiment of the present invention will now be described, by way of 
example, with reference to the accompanying drawings, in which: 

25 

Figure 1 shows part of a processor array, illustrating six switching sections and the 
locations of six arithmetic logic units; 

Figure 2 is a diagram of part of the arrangement shown in figure 1 on a larger scale, 
30 illustrating one of the switching sections and one of the locations of the arithmetic logic 
units; 
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Figure 3 shows part of the processor array shown in figure 1 on a smaller scale, 
illustrating the locations of the arithmetic logic units and "vertical" busses extending 
across them; 

5 Figure 4 is similar to figure 3, but illustrating "horizontal" busses extending across the 
locations of the arithmetic logic units; 

Figure 5 shows the interconnections between the the busses of figures 2, 3 and 4 at the 
location of one of the arithmetic logic units; 

10 

Figure 6A shows in detail the circuitry of one type of programmable switch in the 
switching sections, for connecting a pair of 4-bit busses which cross each other; 

Figure 6B shows in detail the circuitry of another type of programmable switch in the 
15 switching sections, for connecting a pair of 4-bit busses which meet each other end to 
end; - 

Figure 6C shows in detail the circuitry of another type of programmable switch in the 
switching sections, for connecting carry-bit busses; 

20 

Figure 7 shows the circuitry of a series of NOR gates which may be used in the 
programmable switches of figures 5 and 6; 

Figure 8 shows a modification to the circuitry of figure 7; 

25 

Figure 9 shows a buffer and register which may be used in each switching section; 

Figure 10 is a schematic drawing illustrating how enable signals may be distributed to 
the programmable switches in the switching sections; and 

30 

Figure 11 shows in more detail the circuitry of the arrangement shown in figure 10. 
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In the following description, the terms "horizontal", "vertical", "North", "South", 
"East" and "West" have been used to assist in an understanding of relative directions, 
but their use is not intended to imply any restriction on the absolute orientation of the 
embodiment of the invention. 

5 

The processor array which forms the embodiment of the invention is provided in an 
integrated circuit. At one level, the processor array is formed by a rectangular (and 
preferably square) array of "tiles" 10, one of which is shown bounded by a thick line in 
figure 1. Any appropriate number of tiles may be employed, for example in a 16 x 16, 

0 32 x 32 or 64 x 64 array. Each tile 10 is rectangular (and preferably square) and is 
divided into four circuit areas. It is preferable for these tiles to be logically square (to 
provide symmetry in connection), although it is of less significance that they be 
physically square (this may have some advantage in providing symmetry in timing, but 
this will generally be less likely to be of significance). Two of the circuit areas 12, 

5 which are diagonally opposed in the tile 10, provide the locations for two arithmetic 
logic units ("ALUs"). The other two circuit areas, which are diagonally opposed in the 
tile 10, provide the locations for a pair of switching sections 14. 



20 



Referring to figures 1 and 2, each ALU has a first pair of 4-bit inputs a, which are 
directly connected within the ALU, a second pair of 4-bit inputs b, which are also 
directly connected within the ALU, and four 4-bit outputs f, which are directly 
connected within the ALU. Each ALU also has an independent pair of 1-bit carry inputs 
hci, vci, and a pair of 1-bit carry outputs co, which are directly connected within the 
ALU. The ALU can perform standard operations on the input signals a, b, hci, vci to 
25 produce the output signals f, co, such as add, subtract, AND, NAND, OR, NOR, XOR, 
NXOR and multiplexing and optionally can register the result of the operation. The 
instructions to the ALUs may be provided from respective 4-bit memory cells whose 
values can be set via the «H-tree" structure described below, or may be provided on the 
bus system which will be described below. 



30 



At the level shown in figures 1 and 2, each switching section 14 has eight busses 
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extending across it horizontally, and eight busses extending across it vertically, thus 
forming an 8 x 8 rectangular array of 64 crossing points, which have been numbered in 
figure 2 with Cartesian co-ordinates. All of the busses have a width of four bits, with 
the exception of the carry bus vc at X=4 and the carry bus he at Y=3, which have a 

5 width of one bit. At many of the crossing points, a 4-gang programmable switch 16 is 
provided which can selectively connect the two busses at that crossing point. At some 
of the crossing points, a 4-gang programmable switch 18 is provided which can 
selectively connect two busses which meet end to end at that crossing point, without any 
connection to the bus at right angles thereto. At the crossing point at (4, 3), a 

10 programmable switch 20 (for example as shown in Figure 6C) is provided which can 
selectively connect the carry busses vc, he which cross at right angles at that point. 



The horizontal busses in the switching section 14 will now be described. 

15 At Y=0, busses h2s are cpnnectable by programmable switches 16 to the vertical busses 
at X=0, 1, 2, 5, 6. The busses h2s have a length of two tiles and are connectable end 
to end in every other switching section 14 by a programmable switch 18 at (4, 0). 

At Y= 1 , a bus be extending from an input b of the ALU to the West is connectable by 
20 switches 16 to the vertical busses at X=0, 1. 2, 3. Also, a bus fw extending from an 
output f of the ALU to the East is connectable by switches 16 to the vertical busses at 
X= 5, 6, 7. The ends of the busses be, fw are connectable by a programmable switch 1 8 
at (4, 1). 

25 At Y =2, a bus hregs is connectable by programmable switches 16 to the vertical busses 
atX=l, 2, 3, 5, 6,7. 

At Y= 3, a bus hco extends from the carry output co of the ALU to the West to a 
programmable switch 20 at (4,3), which can connect the bus hco (a) to a carry bus hci 
30 extending to the carry input hci of the ALU to the East or (b) to a carry bus vci 
extending to the carry input vci of the ALU to the South. 
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At Y=4, a bus hregn is connectable by programmable switches 16 to the vertical busses 
atX=0, 1, 2, 3, 5, 6. 



At Y=5, busses hi are connectable to the vertical busses at X=0, 1, 2, 3, 5, 6, 7. The 
busses hi have a length of one tile and are connectable end to end in each switching 
section 14 by a programmable switch 18 at (4, 5). 

At Y= 6, a bus fe extending from an output f of the ALU to the West is connectable by 
switches 16 to the vertical busses at X=0, 1, 2, 3. Also, a bus aw extending from an 
input a of the ALU to the East is connectable by switches 16 to the vertical busses at 
X=5, 6, 7. The ends of the busses fe, aw are connectable by a programmable switch 18 
at (4, 6). , 

At Y=7, busses h2n are connectable by programmable switches 16 to the vertical busses 
at X= 1, 2, 3, 6, 7. The busses h2n have a length of two tiles and are connectable end 
to end in every other switching section 14 by a programmable switch 18 at (4, 7), 
staggered with respect to the programmable switches 18 connecting the busses h2s at (4, 
0). 



20 The vertical busses in the switching section 14 will now be described. 

At X=0, busses v2w are connectable by programmable switches 16 to the horizontal 
busses at Y=0, 1, 4, 5, 6. The busses v2w have a length of two tiles and are 
connectable end to end in every other switching section 14 by a programmable switch 
25 18 at (0,3). 

At X= 1, a bus fn extending from an output f of the ALU to the South is connectable by 
programmable switches 16 to the horizontal busses at Y=0, 1, 2. Also, a bus bs 
extending from an input b of the ALU to the North is connectable by switches 16 to the 
30 , horizontal busses at Y=4, 5, 6, 7. The ends of the busses fn, bs are connectable by a 
programmable switch 18 at (1, 3). 
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At X=2, busses vl are connectable to the horizontal busses at Y=0, 1, 2, 4, 5, 6, 7. 
The busses vl have a length of one tile and are connectable end to end in each switching 
section 14 by a programmable switch 18 at (2, 3). 

5 At X=3, a bus vregw is connectable by programmable switches 16 to the horizontal 
busses at Y= 1, 2, 4, 5, 6, 7. 

At X=4, a bus vco extends from the carry output co of the ALU to the North to the 
programmable switch 20 at (4, 3), which can connect the bus vco (a) to the carry bus he i 
10 extending to the carry input hci of the ALU to the East or (b) to the carry bus vci 
extending to the carry input vci of the ALU to the South. 

At X= 5, a bus vrege is connectable by programmable switches 16 to the horizontal 
busses at Y=0, 1, 2, 4, 5, 6. 

15 

At X=6, a bus an extending from an input a of the ALU to the South is connectable by 
switches 16 to the horizontal busses at Y=0, 1, 2. Also, a bus fs extending from an 
output f of the ALU to the North is connectable by programmable switches 16 to the 
horizontal busses at Y =4, 5, 6, 7. The ends of the busses an, fs are connectable by a 
20 programmable switch 18 at (6, 3). 

At X=7, busses v2e are connectable by programmable switches 16 to the horizontal 
busses at Y= 1, 2, 5, 6, 7. The busses v2e have a length of two tiles and are connectable 
end to end in every other switching section 14 by a programmable switch 18 at (7, 3) 
25 staggered with respect to the programmable switches 18 connecting the busses v2w at 
(0, 3). 

As shown in figure 2, the busses bs, vco, fs are connected to input b, output co and 
output f, respectively, of the ALU to the North of the switching section 14. Also, the 
30 busses fe, hco, be are connected to the output f, output co and input b of the ALU, 
respectively, to the West of the switching section 14. Furthermore, the busses aw, hci. 
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fw are connected to the input a, input ci and output f, respectively, of the ALU to the 
East of the switching section 14. Moreover, the busses fn, vci, an are connected to the 
output f, input ci and input a, respectively, of the ALU to the south of the switching 
section 14. 



In addition to these connections, the busses vregw, vrege are connected via respective 
programmable switches 18 to 4-bit connection points vtsw, vtse, respectively, (shown 
by crosses in Figure 2) in the area 12 of the ALU to the North of the switching section 
14. Also, the busses hregs, hregn are connected via respective programmable switches 
18 to 4-bit connection points htse, htne, respectively, in the area 12 of the ALU to the 
West of the switching section . 14. Furthermore, the busses hregs, hregn are connected 
via respective programmable switches 18 to 4-bit connection points htsw, htnw, 
respectively, in the area 12 of the ALU to the East of the switching section 14. 
Moreover, the busses vregw, vrege are connected via respective programmable switches 
18 to 4-bit connection points vtnw, vtne, respectively, in the area 12 of the ALU to the 
south of the switching section 14. These connection points vtnw , vtne, htne, htse, vtse, 
vtsw, htsw, htnw will be described below in further detail with reference to figures 3 to 



10 



15 



20 



25 



30 



5. 



Also, as shown in figure 2, the busses hregn, vrege, hregs,. vregw have respective 4-bit 
connection points 22 (shown by small squares in figure 2) which will be described below 
in further detail with reference to figure 9. 

Figure 3 shows one level of interconnections between the locations of the arithmetic 
logic units, which are illustrated by squares with rounded corners. A group of four 4-bit 
busses v8, v4w, v4e, vl6 extend vertically across each column of ALU locations 12. 
The leftmost bus v8 in each group is in segments, each having a length generally of eight 
tiles. The leftmost but one bus v4w in each group is in segments, each having a length 
generally of four tiles. The rightmost but one bus v4e in each group is in segments, 
again each having a length generally of four tiles, but offset by two tiles from the 
leftmost but one bus v4w. The rightmost bus v!6 in each group is in segments, each 



SIIRSTITUTE SHFFT /Rill P 7R\ 



WO 98/33277 PCT/GB98/00262 

- 11 - 

having a length generally of sixteen tiles. At the top edge of the array, which is at the 
top of figure 4, and at the bottom edge the lengths of the segments may be slightly 
greater than or shorter than specified above. 

5 Referring to figures 3 and 5, where each group of four busses v8, v4w, v4e, v 16 crosses 
each ALU location 12, four 4-bit tap connections are made at the connection points 
htnw, htsw, htse, htne. The ends of the bus segments take priority in being so connected 
over a connection to a bus segment which crosses the ALU location. 

10 Similarly, as shown in figures 4 and 5, a group of four 4-bit busses h8, h4n, h4s, hl6 
extend horizontally across each row of ALU locations 12. The uppermost bus h8 in each 
group is in segments, each having a length generally of eight-tiles. The uppermost but 
one bus h4n in each group is in segments, each having a length generally of four tiles. 
The lowermost but one bus h4s in each group is in segments, again each having a length 

15 generally of four tiles, but offset by two tiles from the uppermost but one bus h4n. The 
lowermost bus hl6 in each group is in segments, each having a length generally of 
sixteen tiles. At the left hand edge of the array, which is at the left of figure 4, and at 
the right hand edge the lengths of the segments may be slightly greater than or shorter 
than specified above. Where each group of busses h8, h4n,.h4s, hl6 crosses each ALU 

20 location 12, a further four 4^bit tap connections are made attthe connection points vtnw, 
vtsw, vtse, vtne. The ends of the bus segments take priority in being so connected over 
a connection to a bus segment which crosses the ALU location. 

As shown in figure 5, the connection points htnw, htsw, htne, htse are connected via 
25 programmable switches to the busses hregn, hregs of the switching sections to the West 
and the East of the ALU location. Also, the connection points vtnw, vtne, vtsw, vtse are 
connected via programmable switches to the busses vregw, vrege of the switching 
sections to the North and the South of the ALU location. 

30 The programmable connections 16 between pairs of 4-bit busses which cross at right 
angles will now be described with reference to figure 6A. The conductors of the 
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horizontal busses are denoted as xO, xl , x2, x3 , and the conductors of the vertical busses 
are denoted as yO, yl, y 2, y3. Between each pair of conductors of the same bit 
significance, a respective transistor 160, 161, 162, 163 is provided. The gates of the 
transistors 160, 161, 162, 163 are connected in common to the output of a NOR gate 
5 16g, which receives as its two inputs an inverted ENABLE signal from a single bit 
memory cell, which may be shared by a group of the switches, and the inverted content 
of a single bit memory cell 24. Accordingly, only when the ENABLE signal is high and 
the content of the memory cell 24 is high, the conductors xO, xl, x2, x3 are connected 
by the transistors 160, 161, 162, 163, respectively, to the conductors yO, yl, y2, y3, 
10 respectively. 

The programmable connections 18 between pairs of 4-bit busses which meet each other 
end to end in line will now be described with reference to figure 6B. The conductors of 
one bus are denoted as xlO, xll, xl2, xl3, and the conductors of the other bus are 

15 denoted as x20, x21, x22, x23. Between each pair of conductors of the same bit 
significance, a respective transistor 180, 181, 182, 183 is provided. The gates of the 
transistors 180, 181, 182, 183 are connected in common to the output of a NOR gate 
18g, which receives as its two inputs an inverted ENABLE signal from a single bit 
memory cell, which may be shared by a group of the switches, and the inverted content 

20 of a single bit memory cell 24. Accordingly, only when the ENABLE signal is high and 
the content of the memory cell 24 is high, the conductors xlO, xll, xl2, xl3 are 
connected by the transistors 180, 181, 182, 183, respectively, to the conductors x20, 
x21, x22, x23, respectively. 

25 The programmable connections 20 between the carry conductors hco,vco,hci,vci will 
now be described with reference to figure 6C. The horizontal carry output conductor hco 
is connected to the horizontal carry input conductor hci and the vertical carry input 
conductor vci via transistors 20hh, 20hv, respectively. Furthermore, the vertical carry 
output conductor vco is connected to the vertical carry input conductor vci and the 

30 horizontal carry input conductor hci via transistors 20w, 20vh, respectively. The gates 
of the transistors 20hh, 20vv are connected in common to the output of an inverter 20i. 
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and the gates of the transistors 20hv, 20vh and the input to the inverter 2Qi are connected 
to the output of a NOR gate 20g. The NOR gate 20g receives as its two inputs an 
inverted ENABLE signal from a single bit memory cell, which may be shared by a 
group of the switches, and the inverted content of a single bit memory cell 24. 
5 Accordingly, when the ENABLE signal is high, the conductors hco, vco are connected 
to the conductors hci, vci, respectively, or to the conductors vci, hci, respectively, in 
dependence upon the content of the memory cell 24. 

It will be noted that each of the switchable connections 16, 18, 20 described with 
10 reference to figures 6A to 6C includes a NOR gate 16g, 18g, 20g. As shown in figure 
7, a NOR gate 16g is typically formed by four transistors 16gl , 16g2, 16g3, 16g4, two 
16gl, 16g3 of which are responsive to the inverted ENABLE signal, and two 16g2, 
16g4 of which are responsive to the inverted content of the memory cell 24. In the 
embodiment of the invention, it is desirable that a group of the switchable collections 16, 
15 18, 20 may be disabled in common, without any need for only part of such a group to 
be disabled. Such a group might consist of all of the switchable connections in one 
switching section 14, all of the switchable connections in the two switching sections 14 
in a particular tile, or all of the switchable connections in a larger area of the array. In 
this case, the transistor 16gl may be made common to all of the switchable connections 
20 16, 18, 20 in the group, as shown in figure 8. This enables a 25% less one saving in the 
number of transistors required for the gates, but does require a further conductor linking 
the gate, as shown in figure 8. 

The man skilled in the art will appreciate that the structures depicted in Figures 7 and 
25 8 can be modified for optimisation. For example, the arrangement of Figures 7 and 8 
would not fully exploit memory cells 24 designed to return both a stored value and a 
complement of that stored value. Use of the complement obtained from such cells 24 
could be used to obviate any need for both the ENABLE and inverted ENABLE signals 
to be carried to all of the switchable connections in a group, as is the case in Figure 8. 

30 

As mentioned above with reference to figures 1 and 2, at each switching section 14, the 



e>ueeT /Bl II C OR\ 



WO 98/33277 

PCT/GB98/00262 

- 14- 

busses hregn, hregs, vregw, vrege are connected by respective 4-bit connections 22 to 
a register or buffer circuit, and this circuit will now be described in more detail with 
reference to figure 9. The four connections 22 are each connected to respective inputs 
of a multiplexer 26. The multiplexer 26th selects one of the inputs as an output, which 
5 is supplied to a register or buffer 28. The output of the register or buffer 28 is supplied 
to four tri-state buffers 30s, 30w, 30n, 30e, which are connected back to the connections 
22 to the busses hregs, vregw, hregn, vrege, respectively. In the case where a buffer 28 
is used, the 4-bit signal on a selected one of the busses hregs, vregw, hregn, vrege is 
amplified and supplied to another selected one of the busses hregs, vregw, hregn, vrege. 
In the case where a register 28 is used, the 4-bit signal on a selected one of the busses 
hregs, vregw, hregn, vrege is amplified and supplied to any selected one of the busses 
hregs, vregw, hregn, vrege after the next active clock edge. 
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It will be appreciated that the arrangement described above provides great flexibility in 
the routing of signals around and across the array. With appropriate setting of the 
switches 16, 18, 20 using the memory cells 24 and . with appropriate setting of the 
multiplexers 26 and registers or buffers 28, signals can been sent over large distances, 
primarily using the busses vl6, hl6, v8, b.8, v4e, v4w, h4n, h4s from the edge of the 
array to a particular ALU, between ALUs, and from a particular ALU to the edge of the 
array. These busses can be joined together in line, or at right angles, by the switching 
sections 14, with amplification by the registers or buffers 28 in order to reduce 
propagation delays, and with pipeline stages introduced by the registers 28. Also, these 
busses can be tapped part way along their lengths, so that the siting of the ALUs to 
perform a particular processing operation is not completely dictated by the lengths of the 
busses, and so that signals can be distributed to more than one ALU. Furthermore, the 
shorter length busses described with reference to figures 1 and 2 can be used to route 
signals between the switching sections 14 and the ALUs, and to send signals primarily 
over shorter distances, for example from one ALU to an adjacent ALU in the same row 
or column, or diagonally adjacent, even though the busses extend horizontally or 
vertically. Again, the registers or buffers 28 can be used to amplify the signals or 
introduce programmable delays into them. 
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In the arrangement described above, the memory cells 24 are distributed across the array 
to the same extent as the switching sections 14 and the ALU locations 12. Each memory 
cell 24 is disposed adjacent the switch or switches, multiplexer, register or buffer which 
it controls. This enables a high circuit density be achieved. 

5 

A description will now be made of the manner in which data is written to or read from 
the memory ceils 24, the way in which the ENABLE signals for the programmable 
switches 16, 18, 20 are written to their memory cells, the way in which instructions, and 
possibly constants, are distributed to the ALUs, and the way in which other control 

10 signals, such as a clock signal, are transmitted across the array. For all of these 
functions, an "H-tree" structure (which is known per se) may be employed, as shown 
in figure 10. Referring to Figures 10 and 11, in order to distribute an ENABLE signal 
to any of 64 locations in the example shown, the ENABLE signal 30a and a 6-bit 
address 32a for it are supplied to a decoder 34a. The decoder 34a determines which of 

15 the four branches from it leads to the address and supplies an ENABLE signal 30b -to a 
further decoder 34b in that branch, together with a 4-bit address 32b to the decoders 34b 
in all four branches. The decoder 34b receiving the ENABLE signal. 30b determines 
which of the four branches from it leads to the required address and supplies an 
ENABLE signal 30c to a further decoder 34c in that branch, together with a 4-bit 

20 address 32c to the decoders 34c in all four branches. The decoder 34c receiving the 
ENABLE signal 30c then supplies the ENABLE signal 34d to the required address 
where it can be stored in a single bit memory cell. An advantage of the H-tree structure 
is that the lengths of the signal paths to all of the destinations are approximately equal, 
which is particularly advantageous in the case of the clock signal. 

• 25 

A great advantage of the arrangement described above is that groups of the memory cells 
24 in for example one switching section 14, or in the two switching secti ons in one tile, 
or in the switching sections in a sub-array of the tiles may be disabled en bloc by the 
inverted ENABLE signals so that the contents of those memory cells do.not affect the 
30 associated switches. It is then possible for those memory cells 24 to be used as "user" 
memory by an application, rather than being used for configuring the wiring of the 
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array. 

The embodiment of the invention has been described merely by way of example, and 
many modifications and developments may be made in keeping with the present 
5 invention. For example, the embodiment employs ALUs as the processing units, but 
other processing units may additionally or alternatively be used, for example look-up 
tables, programmable logic arrays and/or self-contained CPUs which are able to fetch 
their own instructions. 

10 Furthermore, the embodiment has been described as if the whole array is covered by 
ALUs and switching sections. However, other types of section may be included in the 
array. For example, a sub-array might be composed of a 4 x 4 arrangement of tiles of 
ALUs and switching sections as described above, and the array might be composed of 
such sub-arrays and memory in a 4 x 4 array, or such sub-arrays and RISC CPUs in a 

15 4x4 array. 

In the embodiment described above, each ALU location is square, and each switching 
section is square and of the same size as the ALU locations, but it should be noted that 
the controllable switches 18 in the register busses vregw, vrege, hregn, hregs encroach 
20 into the square outline of the ALU locations. The ALU locations need not be of the same 
size as the switching sections, and in particular may be smaller, thus permitting one or 
more busses to pass horizontally or vertically directly from one switching section 14 to 
a diagonally adjacent switching section 14, for example ninning between the busses h2s , 
h2n or between the busses v2e, v2w. 



25 



30 



In the embodiment described above, each ALU has two independent carry inputs vci, 
hci and a connected pair of carry outputs co. If required, the ALUs may be arranged to 
deal with two types of carry: a fast carry between adjacent ALUs which may be of 
particular use for multi-bit adding operations; and a slow carry which can be routed 
more flexibly and may be of particular use for digital serial arithmetic. The fast carry 
might be arranged in a similar manner to that described above with reference to the 
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drawings, whereas the slow carry might employ programmable switches in the switching 
sections 14 between the carry conductor and particular bits of the 4-bit busses. 

In the embodiment described above, particular bit widths, sizes ofswitehing section and 
5 sizes of array have been mentioned, but it should be noted that all of these values may 
be changed as appropriate. Also, the programmable switches 16, 18, 20 have been 
described as being disposed at particular locations in each switching section 14, but other 
locations may be used as required and desired. 

10 In the embodiment described above, the array is two-dimensional, but the principles of 
the invention are also applicable to three-dimensional arrays, for example by providing 
a stack of the arrays described above, with the switching sections in adjacent layers 
staggered with respect to each other. The stack might include just. two layers, but 
preferably at least three layers, and the number of layers is preferably a power of two. 

15 

In the embodiment described above, the memory cells 24 can be isolated by the gates 
16g, 18g, 20g from the switches which they control so that the memory cells can be used 
for other purposes, that is put in the "user plane". The ENABLE signal memory cells, 
however, cannot be transferred to the user plane. In an alternative embodiment, the 
20 switches in a particular switching section 14 may be disconneetable from the remainder 
of the array by further switches in the busses at the boundary of that switching section 
14, with the further switches being controlled by a further memory cell which cannot be 
transferred to the user plane. 

25 Many other modifications and developments may also be made. 
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CLAIMS 

1. An integrated circuit having a field programmable circuit region arranged as a 
generally rectangular array of rows and columns of circuit areas, wherein: 
5 some of the circuit areas (12) each provide a respective processing unit for 

performing operations on data on at least one respective input signal path (an, aw, be, 
bs, hci, vci) to provide data on at least one respective output signal path (fn, fe, fs, fw, 
vco, hco); 

others of the circuit areas each provide a respective switching section (14); 
10 the processing units and the switching sections are arranged alternately in each 

row and in each column; and 

each of a substantial proportion of the switching sections provides a 
programmable connection (16, 18, 20) between at least some of the signal paths of those 
of the processing units adjacent that switching section in the same column and in the 
15 same row. 

2. An integrated circuit as claimed in claim 1 , wherein all of the switching sections 
are so arranged to provide such programmable connections. 

20 3 . An integrated circuit as claimed in any preceding claim, wherein at least one of 
the processing units and the processing units adjacent thereto each have a first input (a), 
a second input (b) and an output (f). 
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30 



4. An integrated circuit as claimed in any preceding claim, wherein a first type (18) 
of the connections provided by the switching sections are between such signal paths 
which are generally colinear with or parallel to each other. 

5. An integrated circuit as claimed in claim 4, wherein the output of said one 
processing unit is connectable: 

by such a first type of connection (18 at (4, 6)) to the first input of the next 
processing unit in one direction in the same row; 
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by such a first type of connection (18 at (6, 3)) to the first input of the next 
processing unit in the one direction in the same column; 

by such a first type of connection (18 at (4, i)) to the second input of the next 
processing unit in the opposite direction in the same row; and 
5 by such a first type of connection (18 at (1, 3)) to the second input of the next 

processing unit in the opposite direction in the same column. 

6. An integrated circuit as claimed in any preceding claim, wherein a second type 
(16) of the connections provided by the switching sections are between such signal paths 

10 which are generally orthogonal to each other. 

7. An integrated circuit as claimed in claim 6 wfyen dependent directly or indirectly 
on claim 3, wherein the output (f) of said one processing unit is connectable: 

by such a second type of connection (16 at (6, 6)) in the same column to the first 
15 input of the diagonally adjacent processing unit in said one row direction and said one 
column direction; 

by such a second type of connection (16 at (6, 1)) in the same row to the first 
input of the diagonally adjacent processing unit in said opposite row direction and said 
one column direction; 

20 by such a second type of connection (16 at (1, 1)) in the same column to the 

second input of the diagonally adjacent processing unit in said opposite row direction and 
said opposite column direction; and 

by such a second type of connection (16 at (1 , 6)) in the same row to the second 
input of the diagonally adjacent processing unit in said one row direction and said 

25 opposite column direction. 

8. An integrated circuit as claimed in any preceding claim, wherein substantially all 
of the input and output signal paths are oriented in directions substantially parallel to the 
rows or the columns. 

30 

9. An integrated circuit as claimed in any preceding claim, further comprising a 



enoeTiTUTE CUCCT mill P 26) 



WO 98/33277 

PCT/GB98/00262 

-20- 

pluraiity of inter switching section signal paths (hi, h2n, h2s, h4n, h4s, h8, hl6, vl, 
v2e, v2w, v4e, v4w, v8, vl6), each of which extends from a respective first one of the 
switching sections to a respective second one of the switching sections in the same row 
in a direction primarily generally parallel to that row, or in the same column in a 
5 direction primarily generally parallel to that column, each of the inter switching section 
signal paths being programmably connectable by the respective first switching section 
to others of the signal paths at that first switching section, and being programmably 
connectable by the respective second switching section to others of the signal paths at 
that second switching section. 
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- 10. An integrated circuit as claimed in claim 9. wherein for one type (hi , vl) of the 
inter switching section signal paths, there are' no such switching sections in the 
respective row or column between the respective first and second switching sections. 

15 11. An integrated circuit as claimed in claim 9 or 10, wherein for another type (h2n , 
h2s, h4n, h4s, h8, hl6, v2e, v2w, v4e, v4w, v8, vl6) of the inter switching section 
signal paths, the respective first and second switching sections have a number of other 
such switching sections therebetween in the respective row or column. 

20 12. An integrated circuit as claimed in any of claims 9 to 1 1 , wherein for a further 
type (h4n, h4s, h8, hl6, v4e, v4w, v8, vl6) of the inter switching section signal paths, 
each signal path has a spine portion (h4n, h4s, h8, hl6, v4e, v4w, v8, vl6) extending 
in a direction generally parallel to the respective row or column and first and second end 
portions (vtne, htne, htse, vtse, vtsw, htsw, htnw, vtnw) each extending in a direction 

25 generally orthogonal to the respective row or column and interconnecting the spine 
portion and the respective first and second switching sections, respectively. 

13. An integrated circuit as claimed in claim 12, wherein for at least some of the 
inter switching section signal paths of said further type, the respective first and second 
30 switching sections have a number of other such switching sections therebetween in the 
respective row or column. 
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14. An integrated circuit as claimed in claim 1 1 or 13, wherein at least some of said 
numbers are each one less than a power of two. 

15. An integrated circuit as claimed in claim 13, or claim 14 when dependent on 
5 claim 13 , wherein at least some of the inter switching section signal paths of said further 

type each have at least one tap portion (vtne, htne, htse, vtse, vtsw, htsw, htnw, vtnw) 
extending in a direction generally orthogonal to the respective row or column and 
interconnecting the spine portion and a respective such other switching section. 

10 16. An integrated circuit as claimed in any preceding claim, wherein at least some 
of the switching sections each include a respective register and/or buffer (28) having an 
input and an output each switchably connectable to at least some of the signal paths at 
that switching section. 

15 17. An integrated circuit as claimed in any preceding claim, wherein at least one of 
the processing units is an arithmetic logic unit. 

18. An integrated circuit as claimed in any preceding claim, wherein at least one of 
the processing units has a plural-bit input and/or a plural-bit output, and wherein at least 
20 some of the signal paths are provided by respective plural-bit busses. 



eiiacTiTiiTC CUPPT (RULE 26) 



WO 98/33277 PCT/GB98/00262 

1/8 



14 



f 



r*4 




^ *5 *a c *a 



FIG.1 



THi&r^-^«^< USPT0) 



WO 98/33277 



2/8 



PCT/GB98/00262 



14 



22 



43 

r 




FIG. 2 



= x-x eelectably 
connectable to y-y 
(Figure 6 A) 



x j x£ = xJ eelectably 

connectable to x2 
(Figure 6B) 

s input to or output from ALU 



fa cl = hco, vco eelectably 
connectable to hci, vol 
or vcu hci (Figure 6C) 

= permanent connection to long 
bueeee (Figuree 3 to 5) 

- permanent connection to 
regieter/buffer (Figure 9) 



THIS PAGE BLANK <uspto) 



WO 98/33277 PCT/GB98/00262 




FIG. 



3 



THIS PAGE BLANK (uspto) 



WO 98/33277 PCT/GB98/00262 

4/8 





iii iii iii ill iii iii ill lit ill 
ill iii iii iii iii iii iii i it • iii 




THIS rmt mjm <M*«» 



THIS PAGE BLANK (uspto) 



WO 98/33277 PCT/GB98/00262 

6/8 




<3 PAGE BLANK (usfto) 



WO 98/33277 



PCT/GB98/00262 

7/8 




F/6 9 



X 30a, 32a 




FIG. 10 



, rilS PAGE BLANK (uspto) 



i*AGE BLANK 



INTERNATIONAL SEARCH REPORT 



Into octal Application No 

PCT/GB 98/00262 



A. CLASSIFICATION-OS SUBJECT MATTER 

PC 6 H03K19/177 G06F 15/78 



According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Mlr^umdocumertaWri searched (dassltlcatJon system followed by classincatton symbols) 

IPC 6 HOSft ^06F 



Documentation searched other than minimum do cum entatton to the extent that such documents are Included in the fields searched 



Electronic data base^corisulted during me international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



US 5 233 539 A (AGftAWAL OM P ET AL) 3 
August 1993 

see the whole document 

WO 95 22205 A (XILINX INC) 17 August 1995 
see the whole document 

US 5 208 491 A (EBELING WILLIAM H C ET 
AL) 4 May 1993 

see column 1, line 62 - column 3, line 28 
see column 9, line 57 - line 68; figure 1 

US 5 204 556 A (SHANKAR KAPIL) 20 April 
1993 

see the whole document 



1-18 

1-18 
1-18 



□ 



Further documents are listed In the continuation of box C 



Patent family members are listed in annex. 



• Special categories of cited documents : 

"A" document defining the general state of the art which is not 

considered to be of particular retevance 
"E" earlier document but published on or after the international 

filing date 

"L" document which may throw doubts on priority ciaim(e) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

"O* document referring to an oral disclosure, use, exhibition or 
other means 

"P* document published prior to the international filing date but 
later than the priority date claimed 



T" later document published after the International filing date 
or priority date and not In conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

•X" document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document Is taken alone 

"Y" document of particular relevance; the claimed invention 

cannot be considered to Involve an inventive step when the 
document ts combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
In the art. 

document member of the same patent family 



Oate of the actual completion of me international search 



6 May 1998 



Date of mailing of the international search report 



15/05/1998 



Name and mailing address of the ISA 

European Patent Office. P.B. 5818 Patentlaan 2 
NL - 2280 HV Rijswijk 
Tel. (+31-70) 340-2040. Tx. 31 651 epo nl. 
Fax: (+31-70) 340-3016 



Authorized officer 



Michel, T 



Form PCTASA^lO (second sfteot) (July 1992) 



INTERNATIONAL SEARCH REPORT 



Information on potent family members 


into: on at Application No 

PCT/GB 98/00262 


Patent document 
cited in search report 


Publication 
date 


Patent family 
member(s) 


Publication 
date 



us 
us 
us 

EP 
EP 
JP 
JP 
JP 
US 
US 
US 
US 
US 
US 
US 

us 
us 
us 
us 
us 



5644496 

5260881 

5621650 

0415542 

0744835 

3078317 

3079125 

3079126 

5329460 

5359536 

5422823 

5490074 

5586044 

5469368 

5185706 

5231588 

5255203 

5587921 

5598346 

5740069 



A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 



01-07-97 
09-11-93 
15-04-97 
06-03-91 
27-11-96 

03- 04-91 

04- 04-91 
04-04-91 
12-07-94 
25-10-94 
06-06-95 
06-02-96 
17-12-96 
21-11-95 
09-02-93 

27- 07-93 
19-10-93 
24-12-96 

28- 01-97 
14-04-98 



WO 


9522205 


A 


17-08-95 


EP 


0698312 


A 


28-02-96 










JP 


8509344 


T 


01-10-96 










us 


5682107 


A 


28-10-97 


us 


5208491 


A 


04-05-93 


us 


5367209 


A 


22-11-94 


us 


5204556 


A 


20-04-93 


DE 


583361 


T 


16-03-95 










EP 


0583361 


A 


23-02-94 










JP 


6510403 


T 


17-11-94 










WO 


9220159 


A 


12-11-92 



Form PCT/ISA/210 (patent tamily annex) (July 1992) 



